METHOD AND APPARATUS FOR ENCODING AND DECODING AN OBJECT 

BACKGROUND OF THE INVENTION 



1. Field of the Invention 

The present invention relates to an object encoding and 
decoding technique. It particularly relates to a method and 
an apparatus for encoding and decoding an object, that has a 
surface described as a polygon mesh. 

2. Description of the Related Art 

The widespread use of the Internet gives an opportunity 
for people to access an enormous amount of information without 
any special facilities. As this convenience has become 
understood, electronic commerce using the Internet has 
developed rapidly. In particular, online shopping in which 
orders of commodities are made based on published catalogs, 
has lately drawn public attention as one of the attractive 
uses of electronic commerce. 

One of the most important points of online shopping is 
how to show commodities attractively. To show a photograph of 
the commodity can be effective for this purpose, but when 
users want to look at the commodity from different angles, it 
is hard to meet such a requirement with a 2 -dimensional 
photograph. Therefore, as a recent trend, commodities are 
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often modeled as polygon data so that the users can freely 

manipulate the 3 -dimensional model through magnification, 

rotation, and the like. 

However, when commodities are represented as polygon 

data, the amount of data required raises problems. The 
polygon data required for one commodity could be several 
megabytes if designed to show the fine structure of the 
commodity. Such large data sizes are difficult to manipulate 
and users cannot have the same feeling as when they look at 
paper catalog pages or the like. The success of online 
shopping depends on how quickly commodity image can be 
transmitted and displayed and how the image data and 3- 
dimensional information can be stored in a limited space of 
storage devices. 



SUMMARY OF THE INVENTION 

The present invention has been made in view of the 
above-mentioned problems. One of the objects of the invention 
is to provide a technology for encoding and decoding polygon 
mesh data of objects such as commodities. Another object of 
the invention is to provide a technology for providing useful 
indices for the object at the same time when its polygon data 
are compressed. Yet another object of the present invention 
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is to provide a technology for encoding and decoding the 
polygon data by which the encoded polygon data can be decoded 
in a short time. Still another object of the present " 
invention is to provide a technology for properly encoding and 
decoding the polygon data with a fine structure that may not 
exist in reality. 

According to one aspect of the present invention, an 
object encoding method is provided. The method comprises 
obtaining a description of a surface of an object, defining an 
origin on the surface, decomposing the surface into a 
plurality of independent shape components according to a 
distance from the origin to a point of the surface, and 
encoding the shape components. If the object is described as 
a polygon mesh, the origin may be a predefined base vertex in 
the polygon mesh, and the distance may be a graph distance 
from the base vertex to a vertex of the polygon mesh. 

The graph distance may be defined, for example, as the 
number of edges or the distance along a path from a base 
vertex to another vertex. In particular, in some cases, such 
as where the distance is defined as the number of edges, the 
distance may be represented by a natural number. The shape 
components may include a contour graph that is a set of edges 
that connect vertices that have the same graph distance. 
Roughly speaking, the shape components can be independent 
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regions each of which is a slice of the object sliced starting 
from the base vertex. Once the distance, particularly a graph 
distance, is introduced, the object can be easily decomposed 
into the shape components uniformly. Furthermore, since each 
of the shape components is an independent region, it can be 
easily encoded. The shape components include global 
topological information of the object. Here, global 
topological information is intended to exclude local 
topological information such as connectivity information 
between polygon vertices. Examples of global topological 
information are a structural graph or a skeleton structure 
obtained on the basis of the graph distance. The global 
topological information is useful as an index of the object 
for searching purposes . 

The encoding the shape components may include encoding 
geometrical information of the object and encoding local 
topological information of the object. An example of local 
topological information is connectivity information between 
polygon vertices. The encoding of the local topological 
information may include a description indicating that the 
object is a non-manifold when a shape represented by the 
polygon mesh is a non-manifold. In this case, the description 
may describe the number of sets of polygons such as triangles 
around a vertex that characterizes the non-manifold. The 
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description may also describe the number of planes around a 
vertex that characterizes the non-manifold. 

The encoding the geometrical information may be adapted 
to a local size of the polygon mesh. As an example, the 
5 difference between a predicted value and a real value of a 
vertex under consideration may be encoded by an entropy 
coding. The difference may be adjusted so that the entropy 
coding can be optimized. The geometrical information may 
include information relating to color or a normal of a vertex 

10 and a plane. In this case, the difference between a predicted 
value and a real value of such information may be encoded by 
the entropy coding. 

According to another aspect of the present invention, an 
object decoding method is provided. The decoding method may 

15 be used in combination with the object encoding method of the 
present invention. The method comprises obtaining encoded 
data which describes a surface of an object, extracting a 
plurality of independent encoded shape components which are 
encoded after being decomposed according to a distance from an 

20 origin of the surface, which is included in the encoded data, 
to a point of the surface, and decoding the encoded shape 
components . 

According to still another aspect of the present 
invention, an object encoding apparatus is provided. The 
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apparatus embodies the above-mentioned object encoding method 
and comprises a unit which obtains a description of a surface 
of an object, a unit which defines an origin on the surface, a 
unit which decomposes the surface into a plurality of 
5 independent shape components according to a distance from the 
origin to a point of the surface, and a unit which encodes the 
shape components. 

According to still another aspect of the present 
invention, an object decoding apparatus is provided. The 

10 apparatus embodies said object decoding method and can be used 
in combination with said object encoding apparatus. The 
apparatus comprises a unit which obtains encoded data which 
describe a surface of an object, a unit which extracts a 
plurality of independent encoded shape components which are 

15 encoded after being decomposed according to a distance from an 
origin of the surface, which is included in the encoded data, 
to a point of the surface, and a unit which decodes the 
encoded shape components. 

According to still another aspect of the present 

20 invention, an object encoding method is provided. The method 
comprises obtaining an object, defining a function relating to 
a distance on a surface of the object, obtaining a structural 
graph of the object on a basis of a value of the function, and 
encoding the object in such a form that the structural graph 
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is included. The structural graph may be explicitly or 
implicitly included in the encoded data. The encoded object 
data may be in such a form that the structural graph can be 
reconstructed or referred to for a future use. When the 
object is represented as a polygon mesh, the function may 
output the number of polygon edges along the path from a 
predefined base vertex in the polygon mesh to a vertex of the 
polygon mesh. 

Moreover, any arbitrary combination of the above- 
mentioned structural components in the present invention is 
still effective as an embodiment when applied as a method, a 
system, a server, a terminal, a computer program, and so 
forth. 

This summary of the invention does not necessarily 
describe features that must be included. The invention may 
also be a sub-combination of these described features. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The above and the other objects, features, and 
advantages, will become more apparent from the following 
description of preferred embodiments taken in conjunction with 
the accompanying drawings wherein: 

Fig. 1 shows the relation among indices of 
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singularities, k-cells, and objects to be coded with them; 

Fig. 2a shows one pair of surfaces with the same 
sequence of Morse indices; 

Fig. 2b shows another pair of surfaces with the same 
sequence of Morse indices; 

Fig. 2c shows still another pair of surfaces with the 
same sequence of Morse indices; 

Fig. 3a illustrates the relation between a torus and its 
critical points; 

Fig. 3b illustrates the relation between a torus and its 
cross sectional contours; 

Fig. 3c illustrates the relation between a torus and its 
Reeb graph; 

Fig. 4a shows a parent-child relation of contours; 

Fig. 4b shows a tree structure of contours having a 
parent-child relation; 

Fig. 5 is a diagram explaining a method for encoding a 
torus using operators; 

Fig. 6 shows an example of programming operators in 
pseudo-Pascal code; 

Fig. 7 shows an example of programming operators in 
pseudo-Pascal code; 

Fig. 8 shows an example of programming operators in 
pseudo-Pascal code; 
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Fig. 9 shows icons corresponding to cells; 
Fig. 10 shows the pasting of cells; 

Fig. 11 shows an example of the Reeb graph of an object 
constructed with icons; 
5 Figs. 12(1), 12(2), 12(3), 12(4), 12(5), 12(6), 12(7), 

12(8), 12(9), 12(10), 12(11), and 12(12) show the cross 
sectional contours of an object; 
Q Fig. 13 is a list of operators for constructing an 

=5 object; 

"^^'10 Fig. 14 shows a homotopic transformation of contours; 

l~ Fig. 15 shows four main parameters of an operator; 

Fig. 16 shows gradual transformation of an upper contour 
s,j= into a lower one using a guiding curve; 

Q Fig. 17 shows various types of shape components of an 

15 embodiment; 

Fig. 18 shows a marching pattern; 
Fig. 19 shows a triangle strip of an annulus; 
Fig. 20 shows a triangle strip of a 2 -cell; 
Fig. 21 shows shape components constructed when a single 
20 base vertex is specified; 

Fig. 22 shows shape components constructed when several 
base vertices are specified; 

Figs. 23(a) to 23 (t) show how connectivity information 
is encoded; 
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Fig. 24 shows one example of a non-manifold objects- 
Fig. 25 shows another example of a non-manifold object; 
Fig. 26 illustrates how the connectivity information is 
encoded by using a toroidal graph; 
5 Fig. 2 7 shows a compression method using connectivity 

prediction; 

Fig. 2 8 is a block diagram showing an object encoding 
apparatus of the embodiment; 

Fig. 2 9 is a block diagram showing an object decoding 
10 apparatus of the embodiment. 

DETAILED DESCRIPTION OF THE INVENTION 

Although the preferred embodiments of the present 
15 invention will be described herein, the true scope of the 
present invention is broader than that of these preferred 
embodiments and is more particularly described in the claims. 
Prior to describing the embodiments, the following "base 
techniques" are introduced in order to help explain the 
20 subject area. These "base techniques" are based on a 
dissertation given by Dr. Yoshihisa Shinagawa of Tokyo 
University, in 1993. The full dissertation may be helpful for 
understanding the present invention. 
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Base Techniques 

[1] Surface Coding Based on Morse Theory 

(1) Introduction 

The shape of a solid or a surface in 3D space is 
5 generally represented as a sequence of symbols for coding. 

The coding herein means an object expression using necessary 
information for modeling an object. In the case of natural 
objects, a shape can have so many degrees of freedom that 
coding them demands certain simplification. Topology is a 
10 mathematical means for performing such simplifications. 

3D objects may be interpreted using Morse theory as a 
mathematical tool. As it is described later, the objects can 
be modeled very effectively and consistently. However, Morse 
theory alone is not sufficient for precise 3D surface coding. 
15 The following explains the reasons for this, and describes a 
way to solve the problem by developing some effective 
extensions to Morse theory. 

(2) Classical Morse theory 

Morse theory was primarily motivated by the calculus of 
20 variations, where the problem is describing the minima of a 
functional on an infinite dimensional space of paths. 
Conversely, one can also use the minima of a functional to 
describe some topological features of spaces that are 
otherwise hardly describable. The following is a brief 
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overview of Morse theory. 

1. Dif f erentiable manifolds 

The spaces to which Morse theory can be applied are 
dif f erentiable manifolds. Finite dimensional manifolds will 
5 next be considered. 

Given an integer n, an n-dimensional manifold is a 
topological space such that any point has a neighborhood that 
can be mapped, one-to-one and bicont inuously , on a subset of 
the n-dimensional space R^. Such a mapping is called a chart 
10 and provides coordinates for the points in its domain, which 
correspond to latitude and longitude in the case of the globe. 
For the manifolds to be p times dif f erentiable , the 
transformation from one coordinate system to another should 
also be p times dif f erentiable for points in the range of two 
15 different charts. 

One can thus see a manifold as being made of pieces of 
R'^ overlapping in a dif f erentiable way. For instance, a line 
and a circle can be given with structures of 1-dimensional 
manifolds, and the surface of a sphere can be made a 2- 
20 manifold using at least two charts. One can make the surface 
of a torus a 2 -manifold using at least four charts. Removing 
knotted circles from R^ gives examples of 3 -manifolds, and 
higher dimensional manifolds appear concretely as, for 
example, configuration spaces of robot arms. 
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2 . Dif f erentiable mapping and singularities 

Using charts, mapping from a p-manifold to an n-manifold 
can be numerically expressed as mapping from pieces of to 
pieces of R"^. For this mapping, one can check 
5 differentiability: A mapping is of class c"^ if its components 
are k times continuously dif f erentiable . 

Let's define a height function on coordinates, which 
returns the height of a given point (such as z-coordinate of 
an object embedded in 3D-space) . Jacobian matrix of a height 
10 function h: R^ -> R is given as 

~ dh 

One can compute a Jacobian matrix at each point. Its rank is 
at most min(n, p) . Points at which the rank of the Jacobian 
matrix reaches this maximal value are called "regular points", 

15 while the others are called "singular points" or "critical 
points" . Singular points concerning the height function 
include "peak points" ("maxima"), "saddle points" and "pits" 
("minima") . In other words, at singular points, the vector of 
Jacobian matrix becomes zero, and the normal vector is 

20 oriented in the same direction as that of height . 

3 . Hessian matrix and index 

For a mapping from an n-manifold to R (we call such mapping 
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"functions on a manifold"), a point is critical if and only if 
all partial derivatives are null. At such a point the 
aforementioned function is thus approximated by a quadratic 
form based on second order partial derivatives, the matrix of 
5 which is called the Hessian matrix, and its components are 
described as follows: 













dx^dx^ 










dx^ 


dx^dx^ 






av 




dx^dx^ 


dxl 



The number of negative definite Hessian matrix at the 
singular point is called the index of the singularity. As 
ilJlO shown in Fig. 1, the index is also equal to the number of 
minus signs in the reduced form. Indices of peak point, 
saddle point and pit are 2, 1 and 0, respectively. As shown 
in Figs. 3a-3c, a torus has one, two and one critical points 
having indices 2, 1 and 0, respectively. 
15 A singularity is not degenerate if the Hessian matrix 

has rank n at that critical point, and it is then called 
"nondegenerate" . Any function can be approximated by a 
Morse function. A function on a manifold is called a Morse 
function if none of the singularities are degenerate. 
20 Therefore, the critical points of a Morse function are 
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isolated, so for a compact manifold, only a finite number of 

them exist . 

4 ■ Homotopy types 

Given a manifold and a Morse function on it and knowing 
5 the sequence of the indices of that function's singularities, 
Morse theory shows that a topological space with the same 
homotopy type as the manifold can be constructed as a cell 
complex after a sequence of operations corresponding to the 
singularities of the Morse function. 

10 For any real number representing a height, the cell 

gives a model for the part of the manifold whose points are 
below that heights. As R is scanned from top to bottom, the 
topology of the complex does not change between two successive 
singular values, but each time a singular value is crossed, 

15 the cell complex is updated by attaching a k-cell to the 

former complex, where k is the index of the singularity. In 
brief, the shape of an object can be retrieved by attaching 
cells with the same dimensions as those of critical points. 
Fig. 1 shows the relationship among indices of 

20 singularities, k-cells and an object to be coded by them. 

Here, the object is represented by a torus. As shown in Fig. 
1, each time the threshold crosses critical heights, the 
topology of the set of points below the threshold changes. 
This change is topologically equivalent to attaching a k-cell. 
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As shown in this Figure, 2-cell (k=2) , 1-cell (k:=:l) , and 0- 
cell {k=0) can be represented by shapes such as a bowl laid 
upside down, a string, and a single point, respectively. The 
shape of the original object can be retrieved by attaching 
5 those cells together to build a cell complex followed by 

transforming it as in clay works. For example, a torus can be 
built by attaching one 2-cell, two 1-cells, and one 0-cell. 
Q Note that the mere sequence of indices does not 

VB completely describe the cell complex. Figs. 2a-2c show three 
^yiO pairs of surfaces with the same sequences of Morse indices, 
respectively. Thus, with the sequence of indices only, the 
cell complex cannot be completely determined. Therefore, one 
IS. must know which connected components of the cross section will 
Hp be involved in attaching each cell. 
15 G. Reeb proposed that one can consider a graph obtained 

from the manifold as a topological quotient space. The Reeb 
graph is to represent the mutual relationship of singularities 
and is obtained by representing the object's surface with 
contour lines and the connected components of each coordinate 
20 with a single point. He did this by identifying in the 

manifold (assumed to be compact) all the points that have the 
same value under the Morse function and that are in the same 
connected component as being a corresponding cross section. 
That is, the connected components of the part of the manifold 

MN-70005 



17 



situated strictly between two critical levels are thus 
represented by separate line segments- -the edges of the graph. 
Each singularity corresponds to a vertex of the graph. The 
Reeb graph can be said to be a graph representing the skeleton 
5 of an ob j ect . 

Figs. 3a-3c show the relationship between a torus and 
its Reeb graph. Fig. 3a shows an original torus. Fig. 3b its 

O cross sections, and Fig. 3c its Reeb graph, respectively. 

Disjointed cross-sectional circles on the same plane in Fig. 

•^^'10 3b correspond to the separate two edges in Fig. 3c. 

(3) Limits of the theory 

Importantly, using Morse theory in this classical way 
i>=i permits only the recovery of the manifold's intrinsic 

topological properties. The sequence of indices does not code 
15 the way the manifold is embedded in space. For instance, you 
cannot determine whether an embedded torus is knotted in 
space, since both knotted and unknotted configurations can 
lead to the same singularities, as shown in Fig. 2b. The 
existence of links (shown in Fig. 2c) is another feature 
20 overlooked by simple Morse coding. 

(4) Extending Morse coding 

This section is restricted to the case of surfaces 
associated with compact 2 -manifolds of class that are 
embedded in 3D space. It is desirable that this Morse 
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function on the surface is induced by a height function in 

space. In fact, it is actually sufficient to slightly rotate 

a surface to have singularities become nondegenerate , 

thereby enabling its height function to become Morse. 

5 Morse theory also states that between two critical 

levels, that is, heights of the planes containing critical 

points, the topological type of the section does not change. 

i:3 Thus, a number of more or less bent cylinders can be used to 

-.0 model the surface between two such levels. Since nonsingular 

■'^10 cross sections are made of circles embedded in a plane, the 
I'll 

;~ possible nested configuration call for hierarchical and 

1^^ structural coding- -grouping plural circles that are contained 

in the same circle. 
□ Besides information related to the Reeb graph, that is, 

15 on how vertices are connected to each other, a proposed 

extended coding also contains other information in addition to 
the Morse index- - information about the way cylinders are 
exchanged and turn around each other between two successive 
critical values. 
20 (5) Coding system example 

A prototype coding system example will next be 
explained. This coding system describes a surface by 
attaching a k-cell and tracking the changes in the 
hierarchical structure of contours on cross - sect ional planes. 
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Operators are introduced to describe the attachment of cells 
and code the surface. The iconic representation of cells 
attached by the operators enables easy understanding of the 
structure of the surface to be coded. The major advantage of 
5 this coding system is that the topological integrity is 
guaranteed by the resulting code. 
1. Contours with parent-child relations 

p The method for representing the hierarchical structure 

of contours will next be described. Here, a tree structure is 

= ^10 used. 

=S When a contour contains another contour, the former is 

called the parent contour of the latter and the latter is 
1.=;= called the child contour of the former. Figs. 4a and 4b show 
Q the parent-child relation of contours and its tree structure 
15 representation, respectively. The parent-child relation can 
be nested, as shown in Fig. 4a. In the following #N 
represents contour number N. For example, contour Number 1 is 
abbreviated #1. #1 is the parent contour of #2, which in turn 
is the parent contour of #4. For convenience, contours that 
20 have no parents, such as #1 and #7, are described as the child 
contours of "virtual contour #0". Therefore, #0 comes to the 
top of the tree structure. 

We define an array parent # [ ] to show the parent of the 
given contour. In the example above, parent# [1] =0 . On the 
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other hand, the child contours of a contour are listed in an 
array children [ ] , to which a pointer is contained. For 
example, the pointer to the child contours of #3 is contained 
in children [3] and the child contours #5 and #6 are 
5 represented as 

children[3] t [1] = 5 and children[3] t [2] = 6. 

For convenience, contours having the same parent 
contours are called sibling contours. In Fig. 4a, #2 and #3 
are sibling contours. The parent's parent contour is called 
[plO the grandparent contour and the child's child contour is 
Iji called grandchild contour. Also, a contour in whose interior 
the object exists is called a solid contour and a contour in 
whose interior the object does not exist is called a hollow 
;'3 contour. In Fig. 4a, #1, #4, #5, #6, and #7 are solid 
15 contours, while #2 and #3 are hollow contours. 
2. Operators for attaching cells 

There are four defined operators: Put_eO, Put_el_merge , 
Put_el_divide , and Put_e2 . These paste the cells. In what 
follows, a k-cell is abbreviated to e^. 
20 Construction proceeds from the top to the bottom of an 

object. The process ends when it is no longer possible to 
attach cells. The contour structure on the cross-sectional 
planes is used to represent the status of the surface which 
the operators are constructing. 
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Fig. 5 shows methods for constructing a torus using 
these operators. In the following, functions of the operators 
will be described with reference to this Figure. 

1. First, Put_e2 (0) is used to create a cell e^ as shown 
5 at the top of Fig. 5. The parameter "0" in Put_e2(0) means 

that #1 is created inside #0, which is the virtual contour. 

The cross section of this cell is shown under "Sectional 
i;3 Description" of Figure 5. As shown in Figure 5, Put_e2 has 
•^fl the function to create a contour on a cross-sectional plane. 
^^^10 All the contours the operators create are numbered in the 

order in which they are created. The contour created by 
L Put_e2 (0) is #1 . 

The status of a newly created contour is always 
p "enabled" at first. This means that cells can be attached to 

15 the contour. We show our iconic representation of the cell e^ 

under "Icon" in Fig. 5. 

2. A cell e-^ is pasted to e^ by Put_el_divide (1, nil, 
inside) . A newly created contour is numbered as #2 . The 
third parameter "inside" indicates that #2 is created as the 

20 child contour of parent# [ 1 ] =0 , which is the virtual contour. 
The second parameter is the list of child contours to be 
operated on. In this case, the second parameter is "nil", 
which indicates that no operation is performed on child 
contour, consequently, no child contour is to be removed. 
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3 . Another is pasted to merge #1 and #2 by 
Put_el_merge (1, 2) . This operator merges the two contours 
designated by the first and second parameters into the first. 
Since the contour indicated by the second parameter disappears 
5 by merging, the second contour is removed from the list of 
child contours of its parent contour. The status of the 
second contour is changed from "enabled" to "disabled" . 
i'3 Therefore, no cell can be attached to this contour. 
v3 4. Finally, Put_eO (1) eliminates #1 by pasting e° . The 

^210 status of #1 is changed from "enabled" to "disabled" . The 
If, icon reflects this change. Note that when a contour is 

attached to a cell e°, all of its child contours, if any, have 
^= to be disabled beforehand. 

P The cell attaching procedure ends here because no 

15 enabled contours remain. 

Figs. 6-8 show examples of operator programs in pseudo- 
Pascal code. Max_children and Max_contour_number are defined 
here as sufficiently large positive integers. They are used 
only for memory allocation purposes. 

20 In Fig. 6, Number_of _children shows the number of child 

contours for each contour. Most_recent ly_created# shows the 
most recently created contour number. Contour_status shows 
whether each contour is enabled or disabled. Children gives 
the pointer to the array Child_list that contains the list of 
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the child contours of a given contour. The end of Child_list 
is shown by the constant End_of_list. These variables are 
initialized as follows: 

most_recently_created# : =0 ; 
5 for i:=0 to max_contour_number do 

number_of_children [i] :=0; 
for i:=l to max_contour_nuiTLber do 
fj contour_status [i] : =disabled; 

contour_status [0] : =enabled; 
^-10 Thus, the virtual contour here becomes enabled. Next, 

create_new_contour creates a new contour, increases most 
Li= recently_created#, and initializes its status. 

Procedure create_new_contour ; 
^3 begin 

15 most_recently_created# : =most_recent ly_created#+l ; 

contour_status [most_recent ly_created#] :=enabled; 
end 

In Fig. 7, we define two procedures and three functions 
for later use. add_listed_children adds the contours listed 
20 in the second parameter, clist, to the list of the child 
contours (children [n] T ) of the first parameter. 
remove_listed_children removes the contours listed in the 
second parameter, clist, from the list of the child contours 
(children [n] t ) of the first parameter. These two procedures 
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also update the array NuTiaber_of _children and Parent#. 

On the other hand, the function are_children (n, clist) 
returns "true" when all the contours in clist are child 
contours of #n. Otherwise it returns "false" . The function 
5 in_list (n, clist) returns "true" when clist includes #n. 
Otherwise it returns "false" . The function 

list_containing_only (n) is defined to make a list containing 
O only one contour to be fed to the procedures 

add_listed_children and remove_listed_children . 
^JLO The four operators Put_e2 , Put_eO, Put_el_divide , and 

;S Put_el_merge can now be defined. They are shown in Fig. 8. 

1 . Put_e2 (n) creates a new contour as the child contour 

S of #n. 

O 2. Put_eO (n) eliminates #n by attaching e° . Here, 

15 All_successors_disabled (n) returns "true" if and only if all 
the successors of #n are disabled. 

3. Put_el_divide (n, clist, inside) divides #n to create 
a new contour. The contours listed in clist become the child 
contours of the newly created contour. They are removed from 

20 the list of the child contours of #r where either #r=#n or 

#r=parent# [n] holds. (This depends on whether they have been 
the child contours of #n or parent# [n] . ) Only these two cases 
are permitted. 

4. Put_el_merge (cl, c2) merges #c2 into #cl . #c2 is 
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removed from the list of child contours of parent#[C2] . All 
the child contours of #c2 become the child contours of 
parent#[cl] or #cl, depending on whether #cl is the parent or 
the sibling contour of #c2 . Only these two cases are 
5 permitted. 

To easily understand the structure of the surface being 
coded by these operators, we propose a graphic representation 
O of the cells that constitute the Reeb graph of the surface. 

The icons depicted in Fig. 9 represent cells. We paste 
two cells by letting the flat top and the flat bottom of the 
;S two coincide. We depict the cells for hollow contours with 
M= white icons and cells for solid contours with black icons. 
|,,i= For e^, plural types of icons exist. 

\3 Fig. 10 shows the attachment of cells. The icons of 

15 child contours are drawn inside of the icons of their parent 

contours, as shown in this Figure. Mirror images of the icons 
on the vertical axis are also allowed. 

Icons are characteristic in that they preserve the 
contour structure to which they are attached. For example, 
20 when e^ is attached to the cells as shown in the top half of 

Fig. 10, the icon preserves the structure of the parent-child 
relation of the cells to which it is attached. 

Dummy icons are inserted where necessary to adjust the 
height of cells, as shown in the bottom half of Fig. 10. 
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Because the Reeb graph may not be a planar graph, we can use 
the dummy icons to interchange the cells so that e^ can be 
attached to the adjacent cells. e^ interchanges the cells 
together with its inner structure. To preserve the 
5 hierarchical contour structure, the dummy icons cannot exceed 
the bounds of the parent contour of the contour to which it is 
attached or intrude into other contours. For this reason, 
l;3 only sibling contours can be interchanged using dummy icons, 
'fl In summary. Figs. 11-13 illustrate an example of the 

^io code for constructing an object using the operators. Fig. 11 
shows an example of the Reeb graph of an object using icons. 
Figs. 12 (1), 12(2), 12(3), 12(4), 12(5), 12(6), 12(7), 12(8), 
U 12(9), 12(10), 12(11), and 12(12) are the corresponding cross- 

Q sectional contours, and Fig. 13 shows the operators for 
15 constructing the object. 

[2] Surface construction from codes using operators 
(1) Surface generation as a locus of homotopy 

In this section, surface is constructed from codes 
obtained by the previous method. 
20 As was previously stated, the topology of a contour does 

not change between critical sections. When scanned from top 
to bottom, the shape of a contour changes. This 
transformation of a contour can be best represented by the 
notion of a homotopy. A homotopy transforms one function to 
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another. In what follows, all the contours are to be 
represented by shape functions and the transformation are 
described by means of homotopies. The definition of homotopy 
is as given below. 
5 Definition: Let f, g : X-^ Y be maps where X and Y are 

topological spaces. Then "f is homotopic to g" if there 
exists a map F; X x Y such that 

? Fix,0) = fix) 

n and 

ry 

■.,pl 0 F(x, 1) = g(x) for all points x&X. 

Here I = [0,1]gR. This map F is called "a homotopy from f 
to g" . When F is defined by 

£: F(x,t) = (l-t)f(x) + tg(xl 

1-^ it is called a straight-line homotopy. In Fig. 14, homotopic 
15 transformation of contours is shown. In this Figure, the 

upper contour is represented by the shape function f and the 
lower by g. The surface generated is the locus of the 
homotopy F from f to g. 

(2) Elements for implementing operators 
20 The operations that generate a surface as shown in Fig. 

14 are described as transforming the contours by homotopies. 
1 ■ Elements of the operators 

The following four main elements of the operators are 
described in Fig. 15. 
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(i) f I-^ gives the shape of the upper contour 

(ii) g: I-^ gives the shape of the lower contour 

(iii) F: the homotopy from f to g 

(iv) h: the difference in height between the two contours 
5 2 ■ Shape functions 

The following shape functions f and g are provided, 
(i) point: a constant function that always gives the location 
of a fixed point 
y% (ii) circle: gives the shape of a circle 

[fiLO (iii) polygon: gives the shape of a polygon that connects the 
l.n given vertices 

r== (iv) Bezier: An n- dimensional Bezier curve is written as 



and specified by an ordered set of n points called the control 
15 points PieR-'. The control point can be modified by users. 
Here, B'^i(t) is a Bernstein basis function defined by 



(v) NURBS (Non Uniform Rational B-Spline) curves: The control 
points of NURBS curves are also defined by users. A NURBS 
20 curve is defined by 



f(u) = f^B:'(u)P, {0<t<l) 
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where Wi is a weight associated with each control point. 
Ni^k(t) is a piecewise polynomial of degree (k-1) called a B- 
spline basis function and defined as 

''° [ 0 otherwise 

5 N,,(u) = - ' IV ' (2<k<n + l) . 

NURBS is extensively used in a great number of CAD 
systems. It can represent a quadric surface accurately and 
iil has a local approximation property, that is, if a control 
ill point or the weight associated with it is changed, it affects 

10 the shape of the surface only in its neighborhood. 
'■■"^ 3 . Homotopy F 

:;j As the homotopy F, functions are provided as follows. 

They output cross sectional contours, 
(i) linear: a straight-line homotopy 
15 (ii) quadrant: 

F(x, t) = quadrantfic,^; = -^l-t^ f (x) + (l - ^l-t^]g{x) 

(iii) parabola: F (x, t ) = (1-t^) f (x) +t^ g (x) 

(iv) cardinal spline: A cardinal spline interpolates the upper 
and lower contours. Parameterization can be done 

20 automatically using a toroidal graph that indicates the 
correspondence between the contours . 

(v) guiding curve: A contour is transformed by moving a point 
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of the contour along a guiding curve. More than one guiding 
curve can be attached to a contour. When a contour is 
represented by a Bezier or NURBS curve, the guiding curves may 
be attached to the control points and the transformation may 
be determined by the movement of the control points. The 
movement of a control point where a guiding curve is not 
attached can be computed using the guiding curves of its 
adjacent control points. 

Fig. 16 shows the gradual transformation of the upper 
contour to the lower one by attaching a guiding curve to a 
contour by users. 

In any case, surface patches are generated among the 
resulting contours using the cardinal spline. 
4 . Shape functions of contours for the operators 

The shape functions of contours for the operators are 
given in the following. 

(i) Put_e2 
f : point 

g: specified by users (default: circle) 
F: specified by users (default: quadrant) 

(ii) Put_eO 

f: the shape function of the contour to which the e° is 
attached 
g: point 
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F: specified by users (default: quadrant) 
(ill) Put_el_divide and Put_el_merge 

For Put_el_divide and Put_el_merge , the path c of e"^ : [0, 
IJ-s-R^ of must be determined. In the implementation, the 
path c is specified by the locations of c(0), c(l/2) and g(1). 
The path has to be smooth and its tangent vector at c(l/2) has 
to be parallel to the xy-plane so that c(l/2) becomes the 
saddle point of the generated surface. The default location 
of c (1/2) is at 



c(\l2)-- 



<0).+c(l). 
2 

c(0),+c(l)^ 



2 

c(0), -h 



c{t)- 



c(t) 

U(0 



The default path is an arc of an ellipse connecting c(0) 
and c(l) given by 

{ (l - Vl-(20']b(0), + Vl-(20'c(l/2), ] 

cit)-- 



(l - Vl-(20'>(0)^ + Vl-(20'c(l/2)^. 
(l-20c(0), +2te(l/2)^ 



for 0<t<l/2 and 

\ - Vl-(2-204(l/2), + Vl-(2-2/)^c(l), ' 
c(0 = (l - Vl-(2-20'jb(l/2)^ + 4l-{2-2tfc{\)^ 
(2-20c(l/2),+(2r-l)c(l), 
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for l/2<t<l. The projection of the default path to the xy- 
place is the line segment connecting c(0) and c (1) . A 
parabolic path can easily be provided by substituting (1-x^) 
for The elements for Put_el_divide and Put_el_merge 

5 are as follows. 
•Put_el_divide 

f: the shape function of the contour to which e^ is attached 
c : the path of e^ 

c(0) and c (1) : specified by Si and S2 e [0, 1] where 
Jmo c(0)=f(si) and c(l)=f(s2) respectively 

in gi and ga : obtained by dividing the contour along c 

•Put_el_merge 

c : the path of e""" 

fi and f2 : the shape functions of the contours to which e""" 
15 is attached 

c(0) and c(l) : specified by Si and S2 e [0, 1] where c(0)=fi 
(si) and c(l)=f2 (S2) respectively 

g: obtained by merging the contours along c 

The transformation for Put_el_divide is performed by 
20 transforming the contour using the path of e"*" as the guiding 
curve ; i.e., 

F(si, t)=c(t/2) 

and 

F (S2, t) =c (l-t/2) . 
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On the other hand, the transformation for Put_el_merge 
is obtained as 

Fi(si, t)=c(t/2) 

and 

5 F2 (S2, t) =c (l-t/2) . 

Embodiments 

The preferred embodiments of the present invention will 
next be described in view of the aforementioned base 

10 techniques. 

Regarding the base technique [1] , the Morse theory, 
which provides information concerning singular points and 
indices only, can show conceptually the type of cells 
constituting a solid. According to Morse theory, the 3D shape 

15 of an object can be conceptually reconstructed from basic 

singular point information, that is, singular points, their 
indices, and the sequence of the singular points. However, 
Morse theory fails to present a geometrical relationship in 
which those cells are to be connected to one another. 

20 The Reeb graph helps to resolve this problem of Morse 

theory. The Reeb graph shows the connection between the cells 
and can provide information regarding the connectivity 
relationship between the singular points, which are 
represented by nodes of the graph. Thus, by combining Morse 
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theory with the Reeb graph, the 3D shape of an object can be 
recognized in a more detailed manner. In a particular 
example, existing polygon data representing the shape of the 
object can be obtained, information relating to singular 
5 points of the Morse function can be extracted, and then 

information relating to the connection between the singular 
points can be obtained. In principle, the shape of the object 

a can then be understood based on the above extracted and 

:|l obtained information. 

^"^'^10 Next, it is understood that only the structure or the 

framework of the object can be reconstructed using Morse 
theory and the Reeb graph. Namely, only overall topological 

1:=== information can be obtained. Therefore, additional 

geometrical information may be necessary to understand the 
15 more precise geometrical shape of the object. For this 
reason, contour information may also be obtained. 

A significant difference between the base techniques and 
the present embodiment is the Morse function adopted. The 
base techniques utilize a height function as the Morse 
20 function, which returns the value of the z-axis coordinate. 
On the other hand, the present embodiment adopts a graph 
distance as the Morse function. This function returns the 
number of polygon edges on the shortest path that reaches a 
vertex under consideration from a base vertex that is defined 



35 



on the original polygon object. A Reeb graph can also be 
determined for this Morse function and can be generated 
automatically . 

In the base techniques, when a solid is encoded using 
5 the techniques in section [1] , a surface can be created for it 
using the techniques in section [2]. In general, section [1] 
utilizes the concept of homotopy type, while section [2] 
□ utilizes the concept of a homotopy function. 

'■S According to the techniques in sections [1] and [2] , 

homotopic connection between respective critical sections can 

^5 be used to re-create the surface of the solid. This is 

,^ because the topology remains the same between critical 
sections, and a homotopic connection keeps the topology 

13 unchanged. In other words, the critical sections are the 
15 sections of the necessary and sufficient number of divisions 
so that the surface of the object can be re-created by a 
continuous function like the homotopy function. 

A critical section corresponds to a node of the Reeb 
graph and the space between the critical sections corresponds 
20 to an edge of the graph. The nodes corresponding to cells e^ 
and e° are topologically two-dimensional cells that are 
equivalent to a two-dimensional open disc. Note that the 
cells e^ and e° are singular points that are not degenerate in 
the base techniques, but in the embodiment a singular surface 
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may be degenerate. However such degeneration can be avoided 
by providing an offset onto the singular surface. 

A real object contains a surface, which is topologically 
equivalent to a cylinder, around each edge of the Reeb graph. 
5 Each of the edge topologically corresponds to an annulus, 
which is a name for a cylinder in topology. Therefore the 
Reeb graph of the object is obtained, and thereby the object 
will be decomposed into components including two-dimensional 
i,0 cells and annuli in the embodiment. Once a base vertex is 
_10 defined on the object, the Reeb graph is uniquely determined. 

Therefore any object can be decomposed into the components 
;^ constantly and automatically. This is the merit obtained when 

L=i the base techniques are utilized. As described later, since 
□ the two-dimensional cells and annuli are all in the shape of 
15 triangle strips, they can be encoded easily. 

In the embodiment, an object encoding is divided into 
two stages: encoding connection between polygon vertices and 
encoding geometrical information of the polygon vertices. 
This is because the base techniques aim at the generation of a 
20 smooth surface of the object, while the embodiment aims at the 
compression of the polygon data. A graph distance is adopted 
as the Morse function because it can be highly compatibly 
applied to the polygon data. 

First, a general concept used in the embodiment is 
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explained and procedures of the embodiment are summarized. 
Shape components 

Fig. 17 illustrates shape components in the embodiment. 
Figure 17 shows a three-dimensional object having an "H" 
5 shape . 

A contour of a cross-section on a closed surface M is 
defined as a connected isovalue line when a continuous 
function f: R is defined on the surface. In the 
embodiment, a graph distance from a base vertex is used as the 

10 function f. The first shape component is a contour 

represented as a graph structure called the c-graph. A c- 
graph is composed by sets of nodes and edges, denoted as c- 
nodes 10 and c-edges 14, respectively. The vertices on a 
single c-graph have the same graph distance from the base 

15 vertex. Note that the graph distance is converted into a 
height direction in Figure 17 for ease of understanding. 

A c-graph can be composed by only a single c-node 10. 
In this case, c-node 10 is a peak or a pit that is a singular 
point of the Morse function f. If a c-graph is in a loop 

20 shape, a single c-node 10 can be provided on the loop for the 
sake of convenience and it is considered that there exists a 
c-edge 14 having the c-node 10 as an endpoint . Namely, if a 
c-graph is a loop, the c-graph has one c-node 10 and one c- 
edge 14 in principle. 
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If a c-graph. is in an "8" shape, there exists one c-node 
10 at its cross point, which is a saddle singular point. Each 
of the two loops of the c-graph is a c-edge 14. 

In the embodiment can ensure validity as an orientable 
5 closed surface can also be ensured. The Euler equation is 
satisfied in the same way as for a boundary representation: 

# (c-node) - # (c-edge) + #(2-cell) = 2 - 2 g 
where # denotes the number of elements, and g is the genus of 
the closed surface. In the case of Fig. 17, the above 
10 equation is 6-5+1=2-0. Thus, the encoding apparatus 
of the embodiment ensures that an object is a closed surface 
and its encoding is valid by using this equation. 

Other shape components are an annulus 16 and a 2 -cell 
18. An annulus has two different c-graphs or parts thereof as 
15 its boundaries, and it is topologically equivalent to a 

cylinder. A 2 -cell has only one boundary composed of a c- 
graph or part thereof, and is equivalent to an open disc 
topologically . 

The 2 -cell and the annulus are extracted from the 
20 triangle mesh of an original object based on a graph distance 
so that they can take a form of a triangle strip as described 
later. A triangle strip is defined as a linear triangle 
stream stretching in one direction in which connectivity can 
be represented as a bit stream as shown in Fig. 18. This bit 
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stream is called a marching pattern. In the Figure, the 
marching pattern is "01001011" . 

The graph distance is defined for each vertex. The 
graph distance for a vertex represents the number of edges 
5 that compose a shortest path to the vertex from a base vertex 
and is calculated using Dijkstra's algorithm. Here, it is 
important to note that the difference between a graph distance 
O value for a vertex and that for an adjacent vertex must be 

zero or one. That is, this graph distance function is a 
^MlO continuous function of a positive integer on a surface, and 
;S can therefore be applied to the homotopy model. 

The fundamental shape components have been explained so 
i'= far. In the embodiment, an open annulus is also defined in 
O order to handle an open mesh. An open mesh is a mesh that has 
15 an open edge, that is, an edge at which a triangle exists on 
only one side of the edge. In order to allow for this 
situation, the open annulus takes the form of an acyclic 
triangle strip, and its initial and terminal edges correspond 
to the open edges. This can be compared to the ordinary 
20 annulus described above, circular annulus, in which the 
triangle strip is circular. Although an open annulus is 
topologically equivalent to a disc, the boundaries consist of 
two different c-graphs (or parts thereof) as well as a 
circular annulus. Whether an annulus is circular or open can 
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be determined by traversing the triangles in the annulus in 
one direction from a starting triangle. If the traversing 
returns to the starting triangle, the annulus is circular, 
otherwise open. Note that the open annulus is effective for 
5 encoding a non-manifold. 

Constructing shape components 

The first step in extracting the shape components is the 
C3 classification of each of the triangles, edges, and vertices 
^fl in a triangle mesh. A triangle in a mesh may be included on a 
-^10 2 -cell or on an annulus. If the three vertices of the 
=JJ triangle have the same graph distance value, the triangle is 
on a 2 -cell, otherwise it is on an annulus. 

An edge may belong to any of three components; it may be 
O an edge on a c-graph, an edge on a 2 -cell, or an edge on an 
15 annulus. An edge is classified using the fallowing steps: 

Step 1. If the graph distance values of the end vertices of 
the edge are different, it is on an annulus, otherwise go to 
step 2 . 

Step 2 . If there is an edge of an annulus among the four 
20 adjacent winged edges, it is on a c-graph, otherwise go to 
step 3 . 

Step 3. If at least one triangle among the triangles adjacent 
to the edge has two edges that are already classified as on a 
2-cell, it is on a c-graph, otherwise it is on a 2-cell. 
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A vertex in a mesh corresponds to either a c-node or a 
vertex on a c-edge. The vertex is classified according to the 
attributes of the adjacent edges of the vertex. If the number 
of c-edges incident to the vertex is two, the vertex is on a 
5 c-edge, otherwise it corresponds to a c-node. 

Step 3 is a necessary condition for a 2-cell to be in a 
triangle strip shape. If the existing three edges of a 
O triangle were all on a 2-cell, the 2-cell could be branched 
out at the triangle and it could not be a triangle strip. 
Once the elements of the mesh are classified, the 
!= annuli, 2 -cells, and c-graphs are constructed by grouping the 

classified triangles, edges and vertices. Thus, edges on a c- 
^=1= graph that are connected compose one c -graph, a triangle and 
Q its adjacent triangle on an annulus are integrated into one 

15 annulus if they share a common c-edge on an annulus, and so 
on. The annuli and 2 -cells always take the form of triangle 
strips when constructed in this manner as shown in Fig. 19 and 
Fig. 20. 

The c-graphs can be decomposed to c-nodes and c-edges by 
20 using the classification of vertices and edges described 
above . 

The embodiment compresses the topology of a mesh based 
on a continuous function of the graph distance and the 
particular function used sometimes has a great influence on 
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the compression ratio. The function depends on which vertices 
of the polygon mesh are chosen as the base vertices. The base 
vertices may be specified by a user or may be set 
automatically. Such automatic setting can utilize geometrical 
5 information. For instance, the vertex whose z-axis value is 
the maximum or a cusp of the polygon mesh may be automatically 
set. Global topological information may be also used. For 
instance, the vertex where a branch occurs may be set as a 

'B base vertex. In any cases the graph distance value on the 

^-10 base vertex is zero. 

Fig. 21 shows the shape components constructed when one 
base vertex 20 is specified. Fig. 22 shows the shape 

hA components constructed when several base vertices 2 0 are 

C3 specified. In both cases a torus is used as the object. 

15 In Figs. 21 and 22 the black lines are c-graphs. In 

this example, the method used for Fig. 22 provides twenty 
percent higher compression than that for Fig. 21. This 
improvement is because the selection of a continuous function 
suitable for the structure of the polygon mesh improves a 
20 prediction procedure for geometry compression that is 
described later. 

Lossless connectivity compression 

The member variables of each shape component are now 
compressed by Huffman coding and the marching pattern of the 
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triangle strip, that is, the connectivity information of the 
polygon mesh are encoded as a bit stream. In the case of this 
compression method, the original marching pattern can be 
completely reconstructed by a decoding process and therefore a 
5 lossless compression can be realized. 

Touma et al . propose another lossless connectivity 
compression method in a paper "Triangle Mesh Compression" , 
P Graphics Interface '98 Proceedings, pp. 26-34, 1998. The 
^O- proposed encoding method is to describe the total number of 
■^^10 other vertices connected with one vertex in a predefined 
order. 

Figs. 23(a) to (t) show the procedure of Touma' s 
|,=i= encoding method. In these Figures, the bold lines are called 
Q an active list. The active list partitions the mesh into an 

15 "outer" part containing edges not yet encoded, and an "inner" 
part containing edges already encoded. The broken lines 
indicate edges already encoded. 

In this method, a polygon mesh is first input (Fig. 
23(a)) . Next, a dummy vertex is added and connected to all 
20 boundary vertices (Fig. 23(b)) . This is because Touma' s 

method assumes a closed surface. Next an initial triangle is 
selected, and a focus vertex, sometimes simply referred to as 
a focus, is marked (Fig. 23(c)) . Since the number of vertices 
connected with this focus is six, a code word "add 6" is 
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generated. With the focus unmoved, a next vertex is 
considered in counter-clockwise order and a code word "add 7" 
is generated. Likewise "add 4" is generated for the last 
vertex in the selected triangle. 
5 Next, the active list is expanded in counter-clockwise 

order around the focus (Fig. 23 (d) ) . A code word "add 4" is 
generated for a newly added vertex in the expanded active 
list. Likewise, "add 8" (Fig. 23(e)), "add 5" (Fig. 23(f)), 

ifl and "add 5" (Fig. 23(g)) are generated. All the edges are now 

=10 encoded for this focus. 

The focus is moved on along the active list in counter- 
clockwise order (Fig. 23(h)), "add 4" (Fig. 23(1)) and "add 5" 
(Fig. 23 (j)) are generated. Since the next free edge of the 

13 focus leads to a vertex already in the active list, the active 

15 list is split into two separate active lists (Fig. 23 (k) ) . A 
code word "split 5" is now generated. Here "split" indicates 
the split of the active list, and "5" is an offset indicating 
the number of the unencoded edges along the active list in 
clockwise order around the focus. The smaller one of the 
20 separate active lists is pushed on the stack for future 

treatment, and the encoding procedure proceeds with the second 
one. The focus is moved on (Fig. 23(1)), and "add 4" (Fig. 
23 (m) ) and "add 4" (Fig. 23 (n) ) are generated. The encoding 
procedure is completed for the focus. Next, the dummy vertex 
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is considered and ''add dummy 6" is generated (Fig. 23 (o) ) . 
The larger active list is completed for encoding and the 
smaller active list is popped from the stack (Fig. 23 (p) ) . 
For a new vertex in the active list, "add 4" is generated (Fig . 
5 23 (q) ) . At this point all the edges are encoded. The focus 
is moved on (Fig. 23 (r) , 23 (s) ) , and then the smaller active 
list is completed for encoding (Fig. 23 (t)) . The resulting 
code is "add 6, add 7, add 4, add 4, add 8, add 5, add 5, add 
4, add 5, split 5, add 4, add 4, add dummy 6, add 4". 
LO In the embodiment, Touma's encoding method may be 

applied to each triangle strip. However, Touma's method 
cannot deal with a non-manifold. Another representation 
method according to the present invention may be used for 
r encoding a non-manifold. A non-manifold means an open 

15 surface, for instance when a single edge of the polygon mesh 
is shared by three triangles. 

The method of the present invention represents the 
number of triangles around a vertex. A basic representation 
looks like ''3 (2, 4, 3)". This representation indicates that 
20 there are three sets of triangles around the vertex and these 
sets are composed of two triangles, four triangles, and three 
triangles respectively. In the case of a manifold, there is 
only one set of triangles around a vertex and the generated 
code is simply "add 6" or the like in Touma's method. On the 
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Other hand, the representation of the present invention allows 
a non- manifold to be encoded. 

Figs. 24 and 25 show a part of the surface of a non- 
manifold object. As shown in Fig. 24, a polygon fragment 3 0 
5 exists over a triangle strip 40 that is composed of c-edge 32 
and c-edge 34, the graph distance values of which are m and 
m+1 respectively. In this case, the vertices 36 and 38, at 
which the fragment 3 0 and the triangle strip 4 0 intersect, are 
defined as new c-nodes. The edge 43 between the vertices 36 

^yiO and 38 divides the triangle strip 40 into two regions 

indicated by s and t. The separate triangle strips are 
treated as open annuli. In the Figure the fragment 30 itself 
is encoded as an open annulus . The vertices 36 and 38 are c- 

O nodes and each of them has three sets of triangles around it . 
15 Therefore the code will take the form of "3 (a, b, c) " . As 

shown in Fig. 25, if a polygon fragment 3 0 exists in parallel 
with a triangle strip 40, a new c-node is not provided. 
Likewise, a code for a vertex 46 will take the form of "3 (a, 
b , c ) " . 

20 In the method of this embodiment, Touma's operators such 

as add and split are not used, but four operators based on 
shape components are introduced as follows: 

NEW_CNODE : Create a new c-node and move a focus to the 
c-node ; 
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OLD_CNODE : Move to an existing c-node; 

NEW_CEDGE : Create a new c-edge and move to the c-edge; 

and 

OLD_CEDGE : Move to an existing c-edge. 
5 The operators "NEW_CNODE" and "NEW_CEDGE" take as an 

argument information relating to a set of triangles such as "3 
(2, 4, 3)". The operators ''OLD_CNODE" and "OLD_CEDGE" take as 
an argument the index of the c-node or the c-edge. The non- 
manifold is thus encoded by the invented method. 

LO Lossy connectivity compression 

The connectivity information may also be encoded in a 
lossy way. In the embodiment a toroidal graph representation 
is introduced to achieve this lossy compression of 
connectivity. The toroidal graph was first proposed by Fuchs 

15 et al . , "Optimal Surface Reconstruction from Planar Contours", 
Communications of the ACM, 20(10), pp. 693-702, October, 1977. 
This toroidal graph is used to generate a triangle mesh 
between two cross - sect ions . 

Fig. 26 illustrates the reconstruction of the surface of 

20 an annulus or a 2 -cell by constructing a toroidal graph. An 
annulus has two boundaries which consist of the vertex arrays 
(Po, Pi, ... , Pm-i) and (Qo, Qi, ... , Qn-i ) respectively where 
the size of the marching pattern is m + n. In the case of a 
regular annulus, Po = Pm-i and Qo = Qn-i hold. In the 
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embodiment, it is assumed that there are polygon edges 
connecting Pq - Qo a.nd Pm-i - Qn-i ■ Then the toroidal graph has 
m by n nodes which are denoted ti,j (0<=i<m, 0<=j<n) and each 
node ti^j in the toroidal graph means the existence of a 
5 polygon edge between the vertex Pi and vertex Qj . Each ti,j 

has two directional edges to adjacent nodes and the edge (ti,j 
ti+i,j) indicates the triangle consists of (Pi, Pi+i, Qj) , and 
i;3 the edge (ti,j ti^j+i) indicates (Pi, Qj+i, Qj) . Each edge can 
:.0 be assigned a cost calculated based on the area of the 
^^10 triangle or the angle between the adjacent faces. For 
\^ instance, a smaller cost may be assigned to a triangle with a 
smaller area or a triangle with a smaller angle to a previous 
adjacent face. 

1.3 By constructing the toroidal graph in this way, the 

15 reconstruction of a marching pattern can be reduced to the 

problem of searching the minimum cost path from to,o to tm-i,n-i- 
The minimum cost path can be solved by calculating the minimum 
cost of each node in order according to Dijkstra's algorithm: 

tl,0 -> to,l -> t2,0 ti,i to, 2 . . . ^ tn,-i,n-2 ^ t^- 

20 2,n-l tm-l.n-1- 

Furthermore, Dr. Shinagawa proposed an extended toroidal 
graph theory in the dissertation that was introduced as the 
base techniques. His method is as follows: 

1. a pair of points that are mutually closest are 
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detected between two contour lines; 

2. the closest pair are connected by an edge; and 

3. the other corresponding points are obtained by 
interpolating between the pair of points. 

5 Fuchs' method obtains an optimal path in a discrete 

manner, but Shinagawa's method is a continuous version of 
Fuchs' method and can alleviate artificial "wrinkles" on the 
reconstructed surface that often appears in Fuchs' method. 
The embodiment may use this extended toroidal graph. 

10 Geometry compression 

The embodiment compresses vertex coordinates based on an 
approach proposed in "Geometric compression through 
topological surgery", ACM Transactions on Graphics, 17(2), pp. 
84-115, April, 1998 by Taubin. Generally, the coordinate 

15 values are represented as 32bit floating-point numbers. 

However, since vertices in a triangle mesh exist in only a 
limited space, this representation is wasteful. Therefore, 
considering a bounding box surrounding the object, the float 
value fi of coordinates of the vertex vi can be rounded to a 

20 fixed length value ni . The rounded fixed lengths bx, by and bz 
are defined for x, y and z coordinates respectively. In 
experiments, it was found that b^, by, of between 9 and 12 
bits are sufficient to maintain the quality of meshes that 
contain fewer than 100,000 triangles. The embodiment also 
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employs a prediction scheme to provide further compression of 
the rounded values of the fixed length integers . In the 
encoding of a value of a vertex, the error between the 
original value and the predicted value calculated from the 
5 preceding vertices is coded by Huffman encoding which is an 
entropy coding. 

Although the aforementioned method works well and 
Q produces a high compression ratio, it does not take adaptivity 
=;D of the triangle mesh structure into account. When a mesh has 
^^^10 fine and coarse triangles, it is called an adaptive mesh. If 
I'i the fine parts of the adaptive mesh take on importance, the 

rounding to fixed length integer is made at the expense of the 
1,=.!= resolution of the fine parts. In this case, the errors 
O between the original and the prediction values in the coarse 

15 parts vary widely, and as a result the performance of the 
Huffman encoding is degraded. 

Fig. 27 shows connectivity prediction and compression 
methods for the adaptive mesh, which are intended to solve the 
above problem. In Figure 27, ni-2/. ni_i, ni respectively denote 
20 positions defined after the real positions of vertices Vi_2, 
Vi-i, Vi are rounded. Pi denotes the position of ni linearly 
predicted from ni-2 and ni_i, and Si denotes the error between 
the predicted Pi and a reference point nj-i that is explained 
later. 
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The embodiment assigns an allowance range 60 of 2ai to 
the rounded value ni of the vertex vi, that is, the decoded 
vertex is allowed to take any value in the range [ni - ai, ni + 
ai] . The range ai is larger when the vertex vi is in a coarse 
5 part, and is smaller when the vertex vi is in a fine part. 

Thus, the compression of an adaptive mesh is reduced to 
the problem of how to decide the error Si that has as small a 
Huffman code as possible while satisfying the condition Pi + Si 
G [ni - ai, ni + ai] so that the reference point n^i (being Pi + 
LO Si) can be within the allowance range 60. In order to solve 
this problem, a step number Si based on ai is defined as 
follows: 

1:5 Si = 2^ 

where p is the maximum integer value that does not exceed 
15 log22ai. The step number Si takes the form of 2^ where n is an 
integer value and is the largest number below the allowance 
range 2ai. The Si is then calculated using the Si as follows: 
Si = eiSi, where ei e N is a unit vector. 

Such Si values are uniquely determined and the Si is then 
20 encoded by Huffman encoding. Note that the step number Si can 
be regarded as the resolution of each mesh and therefore the 
Huffman encoding based on all of the steps Si is equivalent to 
encoding each value at different resolution. Thus, effective 
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compression for an adaptive mesh can be achieved. 

In the compression of the vertex coordinates, the 
allowance range parameter ai is calculated using the normal 
vector normi of the vertex vi, the minimum value Imini of the 
5 distances to adjacent vertices, and an adaptive parameter a 
specified by a user as follows: 
ai = (Imini/a) | normi ex|y]z| 
where the parameter ai is calculated for x, y and z 
coordinates individually and the ex|y|z denotes the unit 

10 vectors at each coordinate. 

The adaptive parameter a is greater than zero so that 
the parameter ai becomes larger as the a becomes smaller. 
That is, a small a makes the compression sensitive to the 
adaptivity of a mesh and provides a high compression ratio but 

15 the decoded object is constructed with poorer quality. The 
normal vector normi is used to restrain the vertex from 
shifting in the normal direction because a normal shift is 
more perceptible than an orthogonal shift, especially for 
vertices on a nearly flat surface. 

20 According to our experiments, an adequate value for a is 

8 or 16 or more, and for more bumpy mesh data, occasionally a 
value of about 4 is sufficient. This is because shifts of 
coordinates in a bumpy mesh are less conspicuous than in a 
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smooth mesh. Such features of a mesh should be considered 
when the adaptive parameter a is decided. 

Note that the aforementioned method is only one example 
of the ways to obtain the parameter ai . Furthermore, this 
5 adaptive mesh approach can be applied to other geometrical 
information such as texture coordinates, normal vectors, 
colors, and reflection property. 
Encoding and decoding procedures 

The embodiment uses the following input parameters for 
10 encoding a mesh and controlling the quality and compression 
ratio : 

V = {vii, Vi2, ... }: the array of base vertices; 

bx, by, bz : the rounding bit length of each coordinate; 

a: the adaptivity parameter. 
15 For encoding topology information, the input parameter v 

is used. Although the parameter v sometimes affects the 
compression ratio, it is usually sufficient to simply specify 
an arbitrary vertex. For encoding geometrical information, 
the input parameters b^, by, bz and a (>0) are used. 
20 Once the input parameters have been specified, a 

triangle mesh is encoded through the following steps: 

Step 1: Construction of shape components. 

In this step, a base vertex v is specified and a 
continuous function relating to graph distance is determined. 
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Then an object is decomposed into shape components. 
Step 2: Geometry compression. 

The coordinate values are rounded to bx, by and bg bits, 
and then compressed by the adaptive prediction scheme using 
5 the parameter a. 

Step 3: Topology compression. 

The marching patterns are compressed using toroidal 
J graphs . 

■'i It is important that the geometry compression is 

|~10 performed before the marching pattern compression because the 
ft computation of toroidal graphs depends on the coordinate 
y= values of each vertex. In the decoding phase the toroidal 

graph cannot refer to the original geometry values, but refers 
== to the encoded values only. Therefore it is necessary to use 
15 the encoded geometry values in the encoding phase. 

The decoding follows the same steps as the encoding 
process; i.e., (1) decoding the shape components, (2) decoding 
the geometry values, and (3) decoding the marching patterns. 
In order to test the effectiveness of the embodiment, 
20 polygon objects of 20 to 900 kilobytes in VRML 1.0 (ASCII 

form) were processed. In this experiment, the amount of data 
is reduced by one to two digits while the degradation of the 
image quality is almost imperceptible. The time required for 
encoding and decoding was fairly fast, ranging from 0.5 to 100 
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seconds. The experiment was performed using an MMX Pentium'^'^ 
2 66MHz personal computer with a Linux operating system. The 
connectivity information using the toroidal graph was 
compressed into less than one bit. The number of bits per 

5 triangle was less than 0.2 in the best case. 

In the embodiment, it is not necessary to wait until the 
completion of decoding in order to display an object, since 
the decoded parts can be displayed sequentially. Therefore 
the embodiment is suitable for a progressive display. Now 

0 that the general concept and procedures of the embodiment have 
been described, the structure of the embodiment is now 
explained . 

Fig. 2 8 shows the structure of an object encoding 
apparatus 100 according to an embodiment of the invention. An 
5 object obtaining unit 104 receives the polygon mesh of an 

object from a polygon object storage 102, which may be local 
storage or on a network or the like. A function setting unit 
106 sets a function relating to a graph distance on a surface 
defined by the input polygon mesh. The function setting unit 
20 106 includes an origin defining unit 108 that defines a base 
vertex (or base vertices) as a starting point or an origin 
from which the graph distance is defined, and a distance 
calculating unit 110 that calculates a graph distance from the 
base vertex to each vertex on the polygon mesh. 
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A shape component decomposing unit 112 decomposes the 
polygon mesh into its shape components, and an encoding unit 
114 encodes each of the shape components. Since the function 
relating to the graph distance can be regarded as a Morse 
5 function described in the base techniques above, the encoded 
data of each shape component includes global structure or 
topology information of the object in the form of a Reeb 
graph. The encoding unit 114 includes a geometry compression 
,Q unit 116 that compresses and encodes the geometry information 

yiO of the polygon mesh, and a topology compression unit 118 that 
J compresses the connectivity information of the polygon 

vertices. An encoded data output unit 12 0 outputs the encoded 
data to an encoded data storage 122, which may be local 
3 storage or on a network or the like. 
15 Fig. 2 9 shows the structure of an object decoding 

apparatus 200 according to the embodiment. An encoded data 
obtaining unit 202 receives the encoded data of an object from 
an encoded data storage 122, which may be local storage or on 
a network or the like. A shape component extracting unit 204 
20 extracts the shape components of the object from the input 
encoded data. A decoding unit 206 decodes each of the 
extracted shape components. The decoding unit 206 includes a 
geometry decoding unit 2 08 that reconstructs the geometry 
information of the polygon mesh, and a topology decoding unit 
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210 that reconstructs the connectivity information of the 
polygon vertices. A decoded data output unit 212 outputs the 
decoded data to a decoded data storage 214, which may be local 
storage or on a network or the like. 
5 The encoding and decoding procedures performed by the 

elements in the above configuration have been described 
herein. Note that the encoding and decoding procedures of the 
present invention can also be recorded as a computer program 
and stored in various storage media for distribution, or such 

10 a program can be provided via various transmission methods. 

For the object encoding and decoding of the embodiment, 
various modifications can be applied. First, the embodiment 
assumes that an object is represented in a polygon mesh but 
this is not intended to be a restriction. For instance, the 

15 object may also be represented as a free surface or 

represented analytically. The object may also be represented 
using a boundary representation method or represented in 
constructive solid geometry (CSG) . The present invention can 
also be applied to such object representations. In these 

20 cases, a starting point is provided on the surface of the 

object and the function is defined on the surface. The shape 
components can also be generated based on the function values 
and encoded. 

Secondly, in the embodiment, the starting point or the 
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origin is provided on the surface of the object, but the 
origin may also be taken from outside or inside of the object. 
In this case, a distance from the origin to each point can be 
adopted for the function instead of a graph distance. 
5 Note also that each of the elements of the procedures 

and of the encoding apparatus and decoding apparatus discribed 
above may be realized by software and/or hardware. 

Although the present invention has been described by way 
of exemplary embodiments, it should be understood that those 
10 skilled in the art might make many changes and substitutions 

without departing from the spirit and the scope of the present 
invention that is defined by the appended claims. 
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